Automatic Text Simplification for German
نویسندگان
چکیده
The article at hand aggregates the work of our group in automatic processing simplified German. We present four parallel (standard/simplified German) corpora compiled and curated by group. report on creation a gold standard sentence alignments from sources for evaluating alignment methods this standard. show that one performs best majority data sources. used two as basis first sentence-based neural machine translation (NMT) approach toward simplification In follow-up work, we extended model to render it capable explicitly operating multiple levels using source-side language level labels improves performance with regard evaluation metrics commonly applied measuring quality text simplification.
منابع مشابه
Building a German/Simple German Parallel Corpus for Automatic Text Simplification
In this paper we report our experiments in creating a parallel corpus using German/Simple German documents from the web. We require parallel data to build a statistical machine translation (SMT) system that translates from German into Simple German. Parallel data for SMT systems needs to be aligned at the sentence level. We applied an existing monolingual sentence alignment algorithm. We show t...
متن کاملAutomatic induction of rules for text simplification
Long and complicated sentences pose various problems to many state-of-the-art natural language technologies. We have been exploring methods to automatically transform such sentences as to make them simpler. These methods involve the use of a rule-based system, driven by the syntax of the text in the domain of interest. Hand-crafting rules for every domain is time-consuming and impractical. This...
متن کاملAutomatic Text Simplification via Synonym Replacement
In this study automatic lexical simplification via synonym replacement in Swedish was investigated using three different strategies for choosing alternative synonyms: based on word frequency, based on word length, and based on level of synonymy. These strategies were evaluated in terms of standardized readability metrics for Swedish, average word length, proportion of long words, and in relatio...
متن کاملAutomatic Text Simplification via Synonym Replacement
Automatic lexical simplification via synonym replacement in Swedish was investigated. Three different methods for choosing alternative synonyms were evaluated: (1) based on word frequency, (2) based on word length, and (3) based on level of synonymy. These three strategies were evaluated in terms of standardized readability metrics for Swedish, average word length, and proportion of long words,...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Frontiers in Communication
سال: 2022
ISSN: ['2297-900X']
DOI: https://doi.org/10.3389/fcomm.2022.706718